A study on the natural-sounding Japanese phonetic word synthesis by using the VCV-balanced word database that consists of the words uttered forcibly in two types of pitch accent
نویسندگان
چکیده
In order to synthesize natural-sounding Japanese phonetic words, a novel VCV-concatenation synthesis with an advanced word database is proposed. The word database consists of VCVbalanced phonetic words which are uttered forcibly in type-0 and type-1 pitch accents. The advantage of using the advanced word database is that a variety of VCV-segments with the same phonetic chains and the different pitch patterns could be collected efficiently at the same time. The following pitch modification techniques are used to achieve the sound quality: (1) The optimal VCV-segment set which minimizes the pitch modification rate is selected. (2) Pitch waveforms are extracted by referring to excitation points. (3) Wavelengths of pitch waveforms are adjusted depending on the pitch modification rates. (4) Natural prosody in the VCV-segments in the database is effectively used. Superiority of the proposed database is ensured by means of the pitch pattern matching measurement and the subjective quality evaluation.
منابع مشابه
تکیه در زبان فارسی
Abstract: This research has been carried out in the framework of Auto segmental-metrical (AM) phonology to study the stress in Persian. Two types of abstract and concrete prominences were distinguished in which the first one refers to the stress and the second one refers to the pitch accent. Stress is assumed to be a lexical property of the lexemes, but pitch accent is assumed to be an intonati...
متن کاملAccent Sandhi Estimation of Tokyo Dialect of Japanese Using Conditional Random Fields
When synthesizing speech from Japanese text, correct assignment of accent nuclei for input text with arbitrary contents is indispensable in obtaining naturally-sounding synthetic speech. A phenomenon called accent sandhi occurs in utterances of Japanese; when a word is uttered in a sentence, its accent nucleus may change depending on the contexts of preceding/succeeding words. This paper descri...
متن کاملProsodic transfer in L2 relative prominence distribution: the case study of Japanese pitch accent produced by Italian learners
Relative prominence distribution, one of the major factors characterizing speech rhythm, is largely determined not only by the position of word accent/stress (word accent, henceforth) but also by the treatment of the acoustic correlates involved in word accent production (e.g., duration, F0, amplitude). Languages differ in both aspects, and those differences are expected to cause prosodic trans...
متن کاملWord segmentation in Persian continuous speech using F0 contour
Word segmentation in continuous speech is a complex cognitive process. Previous research on spoken word segmentation has revealed that in fixed-stress languages, listeners use acoustic cues to stress to de-segment speech into words. It has been further assumed that stress in non-final or non-initial position hinders the demarcative function of this prosodic factor. In Persian, stress is retract...
متن کاملPitch Accent in Japanese: Implementation by the C/D Model
In Tokyo Japanese, lexical accent is implemented by pitch pattern control, while phrasal stress patterns, along with pitch variation, convey non-lexical information in discourse. The C/D model represents pitch control by the tonal melody and stress control by the skeletal organization of the utterance. Phonetic implementation of pitch contours is exemplified here for different lexical accent pa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998